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FIX$3 SAME (HTML MARKUP SGML 
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USPAT 


OR 


ON 


2007/05/25 09:23 


L7 
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((DTD (DOCUMENT ADJ TYPE ADJ 
DEFINITION)) with edit$3) 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 
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ON 


2007/05/25 09:24 


L8 
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US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM.TDB 
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ON 


2007/05/25 09:24 


L9 
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US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


SAME 


ON 


2007/05/25 09:24 


L10 


243 


xml pars$3 dtd valid$6 


US-PGPUB; 
USPAT; 
EPO; JPO; 
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SAME 
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US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
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2007/05/25 09:27 
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US-PGPUB; 
USPAT; 
EPO; JPO; 
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SAME 
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2007/05/25 09:27 


L15 


9 


correct$3 malform$3 (expression or 
statement or tag) 


US-PGPUB; 
USPAT; 
EPO; JPO; 
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USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 
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ON 
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((EXTRACT$3 OR RETRIEV$3) 
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US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
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SAME 


ON 
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((CORRECT$3 OR FIX$3 OR 
EDIT$3) DOCUMENT((DTD) OR 
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US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


SAME 


ON 


2007/05/25 09:38 
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((CONVER$6 OR TRANSFORM$3) 
DOCUMENT((DTD) OR (DOCUMENT 
ADJ TYPE ADJ DEFINITION)) 
SELECT$3 ((WELL ADJ FORMED) 
OR MALFORMED)).CLM. 


US-PGPUB; 
USPAT; 
EPO; JPO; 
DERWENT; 
IBM_TDB 


SAME 


ON 


2007/05/25 09:38 
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al. 








2 




US 
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stylesheet 

designs 

using 

meta-tag 

and/ or 

associated 
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Kim; Hong 
J. et al. 
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Bl 


20060808 


Method and 

apparatus 

for 

incrementa 
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computatio 
n of the 
accuracy 
of a 

categoriza 
tion-by- 
example 
system 


707/5 


Davis; Mark 
W. et al. 
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US 

7080083 
B2 


20060718 


Extensible 

stylesheet 

designs in 

visual 

graphic 

environmen 

ts 
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Kim ; Hong 
J. et al. 
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Bl 


20060411 


System and 
method for 
constructi 
on, 

storage, 
and 

transport 
of 

presentati 
on- 

independen 
t multi- 
media 
content 


715/500. 
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Baru; 
Chaitanya 
et al. 
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US 

6941307 
B2 


20050906 


Arrangemen 
t and a 
method 
relating 
to session 
management 
in a 
portal 
structure 


707/10 


Papanikolao 
u; Thomas 
et al . 
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US 

6938079 
Bl 


20050830 


System and 
method for 
automatica 
lly 

conf igurin 
g a client 
device 


709/222 


Anderson; 
Mark et al . 
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US 

6934740 
Bl 


20050823 


Method and 

apparatus 

for 

sharing 

common 

data 

objects 

among 

multiple 

applicatio 

ns in a 

client 

device 


709/213 


Lawande ; 
Sachin et 
al. 
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20050809 


Computer 
system, 
method, 
and 
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business 

communicat 

ions 
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Eggebraaten 
; Thomas 
John et al . 
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6882995 
B2 


20050419 


Automatic 
query and 
trans forma 
tive 
process 


707/3 


Nasr ; Roger 
I. et al . 








11 




US 

6882892 
B2 


20050419 


System and 
method for 
specifying 
elements 
in a 

packaging 
process 


700/97 


Far rah ; 
Timothy 
Francis et 
al. 








12 




US 

6789252 
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20040907 


Building 
business 
objects 
and 

business 
software 
applicatio 
ns using 
dynamic 
object 
definition 
s of 

ingredient 
ial 

objects 


717/100 


Burke ; 
Miles D. et 
al. 








13 
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6606620 
Bl 


20030812 


Method and 
system for 

v» _i_ ci o o i. j. y xii 

g semi- 

structured 

documents 


707/3 


Sundaresan; 
Neelakantan 
et al . 
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US 

6584459 
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20030624 


Database 
extender 
for 

storing, 

querying, 

and 

retrieving 
structured 
documents 


707/3 


Chang ; 
Daniel T. 
et al. 
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US 
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20030211 


Method and 

apparatus 

for 

indexing 
structured 
documents 
with rich 
data types 


707/10 


Cheng; 
Josephine 
M. et al. 
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US 

6438540 
B2 
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Automatic 
query and 
transf orma 
tive 
process 


707/3 


Nasr ; Roger 
I. et al. 
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US 

6421656 
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20020716 


Method and 

apparatus 

for 

creating 
structure 
indexes 
for a data 
base 

extender 


707/2 


Cheng ; 
Josephine 
M. et al. 
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6366934 
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20020402 


Method and 

apparatus 

for 

querying 

structured 

documents 

using a 

database 

extender 


707/513 


Cheng ; 
Josephine 
M. et al. 
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20010717 


System and 
method for 
query 
processing 
of 

structured 
documents 


707/5 


Nasr ; Roger 
I. et al. 
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US 

6083276 
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20000704 


Creating 
and 

conf igurin 
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component - 
based 
applicatio 
ns using a 
text-based 

descriptiv 
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attribute 
grammar 


717/1 


Davidson; 
Harold R. 
et al . 









5/25/2007, EAST Version: 2.1.0.14 





1 


Documen 
t ID 


Issue 
Date 


Title 


Current 
OR 


Inventor 


3 


4 


5 


1 




US 

2007011 
2675 Al 


20070517 


Goods and 

Services 

Locator 

Language 

for 

Uniform 
Resource 
Identifier 
Components 


705/50 


Flinn; 
Brenda Jo 
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US 

2007007 
9235 Al 


20070405 


DYNAMIC 

CREATION 

OF AN 

APPLICATIO 

N'S XML 

DOCUMENT 

TYPE 

DEFINITION 
(DTD) 


715/513 


Bender; 
David 
Michael et 
al. 








21 




US 

7194402 
B2 


20070320 


Method and 

system for 

converting 

files to a 

specified 

markup 

language 


704/3 


Poplawski ; 
Laura J. 








22 




US 

7143343 
B2 


20061128 


Dynamic 

creation 

of an 

applicatio 

n ! s XML 

document 

type 

definition 
(DTD) 


715/513 


Bender; 
David 
Michael et 
al. 
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US 

2006019 
0575 Al 


20060824 


Method and 

apparatus 

for 

provisioni 

ng network 

devices 

using 

instructio 

ns in 

extensible 

markup 

language 


709/222 


Harvey; 
Andrew et 
al. 








2 




US 

2006001 
5489 Al 


20060119 


Digital 
asset data 
type 

definition 
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707/3 


Probst; 
Bruce E. et 
al. 








3 




US 

2003014 
0034 Al 


20030724 


Digital 
asset data 
type 

definition 
s 


707/3 


Probst, 
Bruce E. et 
al. 








4 




US 

2003010 
6025 Al 


20030605 


Method and 
system for 
providing 
XML-based 
web pages 
for non-pc 
inf ormatio 
n 

terminals 


715/523 


Cho, Soo 
Sun et al . 
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US 

2002015 
2244 Al 


20021017 


Method and 

apparatus 

to 

dynamical 1 
y create a 
customized 
user 

interface 
based on a 
document 
type 

definition 


715/530 


Dean, Sara 
Elo et al. 
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2002002 
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System and 
method for 
creating a 
source 
document 
and 

presenting 
the source 
document 
to a user 
in a 
target, 
format 


715/523 


Kutay, Ali 
et al . 
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20010830 


Method and 
system for 
generating 
a display 
rule for a 
structured 
document , 

storage 
medium for 
storing a 
program 
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method and 
system for 

changing a 

structured 

document 

and its 

document 

type 

definition 
, and 

storage 
medium for 
storing a 
program 
therefor 


707/513 


Hori , 

Masahiro et 
al. 
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US 

7209917 
B2 


20070424 


Digital 
asset data 
type 

definition 
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707/3 


Probst; 
Bruce E. et 
al. 
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US 

7200809 
Bl 


20070403 


Multi- 
device 
support . 
for mobile 
applicatio 
ns using 
XML 


715/517 


Paul ; 
Jyotirmoy 
et al . 








10 




US 

7200597 
Bl 


20070403 


Graphic 

search 

initiation 


707/10 


Grizzard; 
Michael R. 








11 




US 

7171685 
B2 


20070130 


Standard 
format 
specif icat 
ion for 
automatica 
lly 

conf igurin 
g IP 

security 
tunnels 


726/14 


Batra; 
Gaurav et 
al. 








12 




US 

6950984 
B2 


20050927 


Method, 

system 

for, and 

program 

product 

for 

generating 
a display 
rule for a 
structured 
document , 
and for 
changing a 
structured 
document 
and its 
document 
type 

definition 


715/513 


Hori ; 

Masahiro et 
al. 
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US 

6611843 
Bl 


20030826 


Specif icat 
ion of 
sub- 

elements 
and 

attributes 
in an XML 
sub- tree 
and method 
for 

extracting 
data 
values 
therefrom 


707/102 


Jacobs ; 

Ronald 

Michael 
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US 

2003018 
8036 Al 


20031002 


Methods 
and 

systems 
for 

program 
migration 


719/310 


Chen, 
Luojia et 
al. 
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US 

2002014 
7747 Al 


20021010 


System for 
converting 
data to a 
markup 
language 


715/513 


Zaharkin, 
Michael S. 








3 




US 

2002010 
0027 Al 


20020725 


Method of 

externaliz 

ing 

relational 
and ASN . 1 - 
formatted 
data into 
XML format 


717/137 


Binding, 
Carl et al . 








4 




US 

2002002 
6461 Al 


20020228 


System and 
method for 
creating a 
source 
document 
and 

presenting 
the source 
document 
to a user 
in a 
target 
format 


715/523 


Kutay, Ali 
et al. 








5 




US 

7209898 
B2 


20070424 


XML 

instrument 
ation 
interface 
for tree- 
based 

monitoring 
architectu 
re 


705/51 


Pf eif f er ; 
Stephen et 
al. 








6 




US 

7200809 
Bl 


20070403 


Multi- 
device 
support 
for mobile 
applicatio 


715/517 


Paul ; 
Jyotirmoy 
et al . 
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US 

7107522 
Bl 


20060912 


System and 
method for 
creating 
extensible 
content 


715/513 


Morgan ; 
Kelly C. et 
al. 








8 




US 

6941306 
B2 


20050906 


Method and 
system for 
accessing 
data by 
using 
SOAP -XML 


707/10 


Kim ; Hyoung 
Sun 
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X 


US 

2002000 
2566 Al 


20020103 


TRANS FROMA 
TION OF 
MARKED UP 
DOCUMENTS 
USING A 
BASE 

ARCHITECTU 
RE 


707/513 


GAJRAJ, 
COLIN 









5/25/2007, EAST Version: 2.1.0.14 



Results (page 1): CORRECT??? <PARAGRAPH> MALFORMED <PARAGRA. . . Page 1 of 6 

Subscribe (Full Service) Re gist e r (Limited Service, Free) Login 
| Search : © The ACM Digital Library O The Guide 

USPTO 



8 PORTAL 



CORRECT??? <PARAGRAPH> MALFORMED < PARAGRAPH > EXF 



P 



Feedback Report a problem Satisfaction 
surve y 



Terms used Found 12,274 

CORRECT??? PARAGRAPH MALFORMED PARAGRAPH EXPRESSION PARAGRAPH DTD of 201,798 



Sort results 
by 

Display 
results 



relevance 



expanded form 



Save results to a Binder 

^ Search Ti ps 

□ Open results in a new 
window 



Try an Advanced Search 

Try this search in The ACM Guide 



Results 1 - 20 of 200 
Best 200 shown 



Result page: 123456Z8910 next 



Relevance scale □ □ H ■ I 



1 aTool: creating validated XML documents on the fly using MS word 
Oliver Meyer 

October 2002 Proceedings of the 20th annual international conference on Computer 
documentation SIGDOC '02 

Publisher: ACM Press 

Full text available: ^ pdf( 2 39.0 2 KB) Additional Information: full citation , abstract , refer ences , in dex terms 

This paper describes aTool, an extension to Microsoft's Word to create XML documents. 
aTool has been developed in a joint project of the publisher Springer Verlag, Technical 
University of Munich (TUM), and Technical University of Aachen (RWTH). It has been 
developed to provide Springer Verlag with uniform XML documents from its authors and 
has become a generic XML creation tool that can be adapted to different document 
structures. For an author, aTool derives XML structures from MS Word editing c ... 

Keywords: A-Posteriori Integration, DOM, Microsoft Office, Microsoft Word, Microsoft 
Word Add-In, XML, character formatting 



Querying structured documents with hypertext links using OQDBMS 
V. Christophides, A. Rizk 

September 1994 Proceedings of the 1994 ACM European conference on Hypermedia 
technology ECHT '94 

Publisher: ACM Press 

Full text available - l fiQ pdf(1.32 MB) Additional Information: full cit at i o n, abst ra c t, r e fere nces , citin gs, index 

terms 

Hierarchical logical structure and hypertext links are complementary and can be combined 
to build more powerful document management systems. Previous work exploits this 
complementarity for building better document processors, browsers and editing tools, but 
not for building sophisticated querying mechanisms. Querying in hypertext has been a 
requirement since [19] and has already been elaborated in many hypertext systems, but 
has not yet been used for hypertext systems superimposed on an und ... 

Keywords: hypertexts, information retrieval, object oriented databases, path 
expressions, query languages, structured documents 



3 Document reuse and semantics: Towards a semantics for XML markup Q 
Allen Renear, David Dubin, C. M. Sperberg-McQueen 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 
DocEng '02 

Publisher: ACM Press 
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Full text available* IS Ddf(72 89 KB) Additional Information: full citation , abstract , references , citin gs, index 
T2a terms 

Although XML Document Type Definitions provide a mechanism for specifying, in machine- 
readable form, the syntax of an XML markup language, there is no comparable mechanism 
for specifying the semantics of an XML vocabulary. That is, there is no way to characterize 
the meaning of XML markup so that the facts and relationships represented by the 
occurrence of XML constructs can be explicitly, comprehensively, and mechanically 
identified. This has serious practical and theoretical consequence ... 

Keywords: SGML, XML, knowledge representation, markup, semantics 



Developing and empirically evaluating robust explanation generators: the KNIGHT 
ex periments 

James C. Lester, Bruce W. Porter 

March 1997 Computational Linguistics, volume 23 issue l 
Publisher: MIT Press 
Full text available: 



' ^gdf(2.64 MB)JflP Additional Information: full citation , abstract , references , citings 
Publisher Site 

To explain complex phenomena, an explanation system must be able to select information 
from a formal representation of domain knowledge, organize the selected information into 
multisentential discourse plans, and realize the discourse plans in text. Although recent 
years have witnessed significant progress in the development of sophisticated 
computational mechanisms for explanation, empirical results have been limited. This paper 
reports on a seven-year effort to empirically study explanation ge ... 

5 Key management and key exchange: A temporal key management scheme for | 
^ secure broadcastin g of XML documents 
Elisa Bertino, Barbara Carminati, Elena Ferrari 

November 2002 Proceedings of the 9th ACM conference on Computer and 
communications security CCS '02 

Publisher: ACM Press 

i- ii* ^ -i ui a ^ooni/ Q \ Additional Information: full citation , abstract , references , citin gs, index 

Full text available: Tqpdf(242.89 KB) * 

10 terms 

Secure broadcasting of web documents is becoming a crucial need for many web-based 
applications. Under the broadcast document dissemination strategy a web document 
source periodically broad-casts (portions of) its documents to a possibly large community 
of subjects, without the need of explicit subject requests. By secure broadcasting we mean 
that the delivery of information to sub-jects must obey the access control policies of the 
document source. Since different subjects may have the right to ... 

Keywords: XML, secure broadcasting, temporal key management 



The "HyTime ": hvpermedia/time-based document structuring language 
Steven R. Newcomb, Neill A. Kipp, Victoria T. Newcomb 
November 1991 Communications of the ACM, volume 34 issue n 

Publisher: ACM Press 

Full text available: ^| pdf(12.96 MB) Additional Information: full citation , references , citings , index terms 



7 Le g al knowled g e bases 2: le g islation: Automatic semantics extraction in law 
documents 

C. Biagioli, E. Francesconi, A. Passerini, S. Montemagni, C. Soria 

June 2005 Proceedings of the 10th international conference on Artificial intelligence 
and law ICAIL '05 
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Publisher: ACM Press 

Full text available: ^ pdf(491.40 KB) Additional Information: full citation , abstract , refer ences 

Normative texts can be viewed as composed by formal partitions (articles, paragraphs, 
etc.) or by semantic units containing fragments of a regulation (provisions). Provisions can 
be described according to a metadata scheme which consists of provision types and their 
arguments. This semantic annotation of a normative text can make the retrieval of norms 
easier. The detection and description of the provisions according to the established 
metadata scheme is an analytic intellectual activity aiming ... 

8 Semantics of paragraphs 
Wlodek Zadrozny, Karen Jensen 

June 1991 Computational Linguistics, volume 17 issue 2 
Publisher: MIT Press 

Full text available: « Qft [§| 

Tg]_paf(Z.8Q MB)^ Additional Information: f u l l ci t a tion , ab st r ac t, re f ere n ces , citings 

Publisher Site 

We present a computational theory of the paragraph. Within it we formally define 
coherence, give semantics to the adversative conjunction "but" and to the Gricean maxim 
of quantity, and present some new methods for anaphora resolution. The theory precisely 
characterizes the relationship between the content of the paragraph and background 
knowledge needed for its understanding. This is achieved by introducing a new type of 
logical theory consisting of an object level, corresponding to the content ... 

9 Industrial and practical experience track paper session 1: Ranking definitions with 
<H^ supervised learnin g methods 

Jun Xu; Yunbo Cao, Hang Li, Min Zhao 

May 2005 Special interest tracks and posters of the 14th international conference on 
World Wide Web WWW '05 

Publisher: ACM Press 

Full text available: "g|pdf( 328.82 KB ) Additional Information: Miration, abstract, references, citings, index 

This paper is concerned with the problem of definition search. Specifically, given a term, 
we are to retrieve definitional excerpts of the term and rank the extracted excerpts 
according to their likelihood of being good definitions. This is in contrast to the traditional 
approaches of either generating a single combined definition or simply outputting all 
retrieved definitions. Definition ranking is essential for the task. Methods for performing 
definition ranking are proposed in this paper, whi ... 

Keywords: classification, ordinal regression, search of definitions, text mining, web 
mining, web search 



10 Web en g ineerin g meets natural lan g ua g e processing: a vocal interface g eneration 
^ practice 

^ Hendrik Macedo, Jacques Robin, Roberto Barros 

December 2005 Proceedings of the 11th Brazilian Symposium on Multimedia and the 

web WebMedia '05 
Publisher: ACM Press 

Full text available: ^ pdf( 1 32.64 KB ) Additional Information: full c i ta tion, a b s tr ac t, references, index terms 

Today's trend towards ever more compact devices such as PDAs and mobile phones 
demands a more pervasive access manner. This paper describes an innovative mediator 
service architecture based on up-to-date web engineering standards to enable voice-based 
access to Web applications by means of voice portals and VoiceXML technologies. The core 
of the architecture is a Natural Language Generator that implements a pipeline of 
transformation rules. We show how Natural Language Generation technology can ... 

Keywords: mediator service architecture, natural language generator, spoken dialogue- 
driven web 
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11 XIRQL: An XML query language based on information retrieval concepts Q 
^jk Norbert Fuhr, Kai Gropjohann 

Nr April 2004 ACM Transactions on Information Systems (TOIS), volume 22 issue 2 
Publisher: ACM Press 

Full text available* fi3 pdf(281 91 KB) Ac,dit ' onal Information: full cita t i on, abstract , references , citings, index 
■ lAj ■ terms 

XIRQL ("circle") is an XML query language that incorporates imprecision and vagueness for 
both structural and content-oriented query conditions. The corresponding uncertainty is 
handled by a consistent probabilistic model. The core features of XIRQL are (1) document 
ranking based on index term weighting, (2) specificity-oriented search for retrieving the 
most relevant parts of documents, (3) datatypes with vague predicates for dealing with 
specific types of content and (4) structural vagueness f ... 

Keywords: Path algebra, XML, XQuery, probabilistic retrieval, ranked retrieval, vague 
predicates 

12 XML processing: Frontiers of tractability for typecheckin g simple XML transformations Q 
Wim Martens, Frank Neven 

June 2004 Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART 
symposium on Principles of database systems PODS '04 

Publisher: ACM Press 

Full text available:^ pdf(229,46 KB) Additional Information: full citation , abstract , references , citings 

Typechecking consists of statically verifying whether the output of an XML transformation 
is always conform to an output type for documents satisfying a given input type. We focus 
on complete algorithms which always produce the correct answer. We consider top-down 
XML transformations incorporating XPath expressions and abstract document types by 
grammars and tree automata. By restricting schema languages and transformations, we 
identify several practical settings for which typechecking is in pol ... 

1 3 Design: Proving the validity and accessibility of d yn amic web-pa g es Q 
R. G. Stone, J. Dhiensa 

May 2004 Proceedings of the 2004 international cross-disciplinary workshop on Web 
accessibility (W4A) W4A '04 

Publisher: ACM Press 

Full text available- fifl Ddf(58 16 KB) Additional Information: full citati o n , abst r a ct, references , citing s, index 
•TS^-L-^ 1 terms 

If a static web-page is checked for accessibility and passes then all is well. However 
checking the accessibility of the output from a dynamic (scripted) web-page is like testing 
a program to find errors. However many times a test succeeds it is always possible that 
the program will produce bad output next time. What is needed is something closer to a 
proof of correctness. This paper describes a first attempt to provide a proof of validity for 
dynamic web-pages which can be extended to a proof 0 ... 

Keywords: accessibility, dynamic Web-pages, validity 
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Jan-Marco Bremer, Michael Gertz 

January 2006 The VLDB Journal — The International Journal on Very Large Data 

Bases, volume 15 issue l 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(841.10 KB) Additional Information: full citation , abstract 

For querying structured and semistructured data, data retrieval and document retrieval 
are two valuable and complementary techniques that have not yet been fully integrated. 
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In this paper, we introduce integrated information retrieval (IIR), an XML-based retrieval 
approach that closes this gap. We introduce the syntax and semantics of an extension of 
the XQuery language called XQuery/IR. The extended language realizes IIR and thereby 
allows users to formulate new kinds of queries by nesting rank ... 

Keywords: Data retrieval, Document retrieval, Index structures, Integrated information 
retrievals, Structural join, XML 
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>^ November 1994 Proceedings of the third international conference on Information and 
knowledge management CIKM '94 
Publisher: ACM Press 

Full text available* 153 pdf(1 03 MB) Additional Information: full citation , abstract , reference s, citing s, index 

: terms 

An open hypermedia-document storage system has to meet requirements that are not 
satisfied by existing systems: it has to support non-generic hypermedia document types, 
i.e. document types enriched with application-specific semantics. It has to provide 
hypermedia-document access methods. Finally, it has to allow the exchange of 
hypermedia documents with other systems. On a technical level, an object-oriented 
database-management system, on a logical level, a well established ISO standard, na ... 

17 extended cumulated g ain measures for the evaluation of content-oriented XML Q 
^ retrieval 

^ Gabriella Kazai, Mounia Lalmas 

October 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue 4 

Publisher: ACM Press 

Full text available: ^ pdf(3.25 MB) Additional Information: full citation , abstract , references , index terms 

We propose and evaluate a family of measures, the extended Cumulated Gain (XCG) 
measures, for the evaluation of content-oriented XML retrieval approaches. Our aim is to 
provide an evaluation framework that allows the consideration of dependency among XML 
document components. In particular, two aspects of dependency are considered: (1) near- 
misses, which are document components that are structurally related to relevant 
components, such as a neighboring paragraph or container section, and (2) over ... 

Keywords: INEX, XML retrieval, cumulated gain, dependency, evaluation, metrics, near- 
miss, overlap 
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terms 

One of the key benefits of XML is its ability to represent a mix of structured and 
unstructured (text) data. Although current XML query languages such as XPath and 
XQuery can express rich queries over structured data, they can only express very 
rudimentary queries over text data. We thus propose TeXQuery, which is a powerful full- 
text search extension to XQuery. TeXQuery provides a rich set of fully composable full-text 
search primitives,such as Boolean connectives, phrase matching, proximity di ... 

Keywords: full-text search, xquery 



19 Preparing heterogeneous XML for full-text search Q 
(H> M' ro Lehtonen 

▼ October 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue 4 
Publisher: ACM Press 

Full text available: ^| pdf (228.25 KB) Additional Information: full citation, abstract, references , index terms 

XML retrieval is facing new challenges when applied to heterogeneous XML documents, 
where next to nothing about the document structure can be taken for granted. We have 
developed solutions where some of the heterogeneity issues are addressed. Our fragment 
selection algorithm selectively divides a heterogeneous document collection into equi-sized 
fragments with full-text content. If the content is considered too data-oriented, it is not 
accepted. The algorithm needs no information about element n ... 

Keywords: XML retrieval, heterogeneous documents, indexing 



20 Ex pert assistance for manip ulating of SGML document ty pe definitions 
W. Timothy Polk, Lawrence E. Bassham 

January 2000 Proceedings of the ACM conference on Document processing systems 
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1 aTool: creatin g validated XML documents on the fly usi ng M S word 
Oliver Meyer 

October 2002 Proceedings of the 20th annual international conference on Computer 
documentation SIGDOC '02 

Publisher: ACM Press 

Full text available: ^ pdf(239.Q2 KB) Additional Information: full cit at io n, abs t ract , references , index terms 

This paper describes aTool, an extension to Microsoft's Word to create XML documents. 
aTool has been developed in a joint project of the publisher Springer Verlag, Technical 
University of Munich (TUM), and Technical University of Aachen (RWTH). It has been 
developed to provide Springer Verlag with uniform XML documents from its authors and 
has become a generic XML creation tool that can be adapted to different document 
structures. For an author, aTool derives XML structures from MS Word editing c ... 

Keywords: A-Posteriori Integration, DOM, Microsoft Office, Microsoft Word, Microsoft 
Word Add-In, XML, character formatting 



2 What makes the differences: benchmarking XML database implementations Q 
Hongjun Lu, Jeffrey Xu Yu, Guoren Wang, Shihui Zheng, Haifeng Jiang, Ge Yu, Aoying Zhou 
February 2005 ACM Transactions on Internet Technology (TOIT), volume 5 issue i 
Publisher: ACM Press 

Full text available:^ pdf(589.14 KB) Additional Information: full citation , abstract , references , index terms 

XML is emerging as a major standard for representing data on the World Wide Web. 
Recently, many XML storage models have been proposed to manage XML data. In order to 
assess an XML database's abilities to deal with XML queries, several benchmarks have also 
been proposed, including XMark and XMach. However, no reported studies using those 
benchmarks were found that can provide users with insights on the impacts of a variety of 
storage models on XML query performance. In this article, we report our ... 

Keywords: XML query processing, XML storage model, benchmark 



3 Lo g ical definability and que r y lan guages o ver ran ked an d unranked trees 
Michael Benedikt, Leonid Libkin, Frank Neven 

April 2007 ACM Transactions on Computational Logic (TOCL), volume 8 issue 2 
Publisher: ACM Press 

Full text available: ^!|„ pdf( 6 01.84 K B) Additional Information: fuJLcitation, a bst ract, re f eren c e s, index terms 
We study relations on trees defined by first-order constraints over a vocabulary that 




http : //portal . acm . org/resul ts . cf m?col 1 =ACM&dl =ACM&CFID=196126 . . . 5/2 5/2007 



Results (page 1): fix??? <paragraph> malformed <paragraph> ... Page 2 of 6 



includes the tree extension relation T&pr; T (holding if and only if every branch of T 

extends to a branch of T) f unary node-tests, and a binary relation checking whether the 

domains of two trees are equal. We consider both ranked and unranked trees. These are 
trees with and without a restriction on the number of children of nodes. We adopt the 
model-theoretic appro ... 

Keywords: Ranked trees, model theory, query languages, tree automata, unranked trees 



4 Session I: Design patterns as higher-order datatype-generic programs 
£v Jeremy Gibbons 

September 2006 Proceedings of the 2006 ACM SIGPLAN workshop on Generic 
programming WGP '06 

Publisher: ACM Press 

Full text available: pdf(214.16 KB) Additional Information: full citation , abstract , references , index terms 

Design patterns are reusable abstractions in object-oriented software. However, using 
current mainstream programming languages, these elements can only be expressed extra- 
linguistically: as prose, pictures, and prototypes. We believe that this is not inherent in the 
patterns themselves, but evidence of a lack of expressivity in the languages of today. We 
expect that, in the languages of the future, the code parts of design patterns will be 
expressible as reusable library components. Indeed, we c ... 

Keywords: design patterns, folds, functional programming, generic programming, higher- 
order functions, unfolds 
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6 D ynamic labeling schemes for ordered XML based on ty pe information 
Damien K. Fisher, Franky Lam, William M. Shui, Raymond K. Wong 

January 2006 Proceedings of the 17th Australasian Database Conference - Volume 49 
ADC '06 

Publisher: Australian Computer Society, Inc. 

Full text available: ^| pdfd 84.30 KB) Additional Information: full citation , abstract , references , index terms 

With the increasing popularity of XML, there arises the need for managing and querying 
information in this form. Several query languages, such as XQuery, have been proposed 
which return their results in document order. However, most recent efforts focused on 
query optimization have either disregarded order or proposed static schemes in which 
updates are not handled efficiently. Some dynamic labelling schemes have been proposed 
but they do not consider type information that is usually available w ... 

Keywords: XML, order maintenance 



Query o ptimization in XML structured-document databases 
Dunren Che, Karl Aberer, Tamer Ozsu 

September 2006 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 15 Issue 3 
Publisher: Springer-Verlag New York, Inc. 

Full text available: ^ pdf(687.23 KB) Additional Information: full citation , abstract 
While the information published in the form of XML-compliant documents keeps fast 
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mounting up, efficient and effective query processing and optimization for XML have now 
become more important than ever. This article reports our recent advances in XML 
structured-document query optimization. In this article, we elaborate on a novel approach 
and the techniques developed for XML query optimization. Our approach performs 
heuristic-based algebraic transformations on XPath queries, represented as PAT a ... 

Keywords: Deterministic query optimization, Query transformation, XML database, XML 
query optimization, XML query processing 



8 Articulatin g information needs in XML query lan gua ges Q 

£k Jaap Kamps, Maarten Marx, Maarten de Rijke, Borkur Sigurbjornsson 

^ October 2006 ACM Transactions on Information Systems (TOIS), volume 24 issue 4 

Publisher: ACM Press 

Full text available: ^ pdf (31 8.47 KB ) Additional Information: full c i tation , abstract , references , index terms 

Document-centric XML is a mixture of text and structure. With the increased availability of 
document-centric XML documents comes a need for query facilities in which both 
structural constraints and constraints on the content of the documents can be expressed. 
How does the expressiveness of languages for querying XML documents help users to 
express their information needs? We address this question from both an experimental and 
a theoretical point of view. Our experimental analysis compares a stru ... 

Keywords: Full-text XML querying, XML retrieval, XPath 



9 TIPSTER architecture: TIPSTER text phase II architecture requirements Q 
Architecture Committee 

May 1996 Proceedings of a workshop on held at Vienna, Virginia: May 6-8, 1996 

Publisher: Association for Computational Linguistics 

Full text available: Qpdf(1.34 MB) Additional Information: full citation , abstract 

The requirements herein are derived from several Government agencies. Some 
requirements may be traced to specific documents given below. Interviews with 
Government personnel were also a source. When possible, the source documents, shown 
as (n), indicate the basis for the TIPSTER requirements. Derived Requirement. 1. BAA 93- 
36 and Scenarios2. Architecture Requirements (draft), Sarah Taylor, 13 March 19943. 
FBIS CONOPS (draft), MITRE, 28 February 19944. ADEPT CONOPS (working draft), MITRE, 
4 Febru ... 



10 Querying structured documents with hypertext links usin g OODBMS III 
V. Christophides, A. Rizk 

^ September 1994 Proceedings of the 1994 ACM European conference on Hypermedia 
technology ECHT '94 
Publisher: ACM Press 

Full text available* 113 df(1 32 MB) Additional Information: full citation , abstract , references , citings , index 

' terms 

Hierarchical logical structure and hypertext links are complementary and can be combined 
to build more powerful document management systems. Previous work exploits this 
complementarity for building better document processors, browsers and editing tools, but 
not for building sophisticated querying mechanisms. Querying in hypertext has been a 
requirement since [19] and has already been elaborated in many hypertext systems, but 
has not yet been used for hypertext systems superimposed on an und ... 

Keywords: hypertexts, information retrieval, object oriented databases, path 
expressions, query languages, structured documents 
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Publisher: ACM Press 
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Often, independent organizations define and advocate different XML formats for a similar 
purpose and, as a result, application programs need to mutually convert between such 
formats. Existng XML transformation languages, such as XSLT and XDuce, are 
unsatisfactory for this purpose since we would have to write, e.g., two programs for the 
forward and the backward transformations in case of two formats, incur high developing 
and maintenance costs.This paper proposes the bidirectional XML tran ... 

Keywords: XML, tree automata 



12 Document reuse and semantics: Towards a semantics for XML marku p 
Allen Renear, David Dubin, C. M. Sperberg-McQueen 

November 2002 Proceedings of the 2002 ACM symposium on Document engineering 
DocEng '02 

Publisher: ACM Press 

Full text available: «pdf(7Z89KB) Additional Information: full citation, abstract, references, citings, index 
^ terms 

Although XML Document Type Definitions provide a mechanism for specifying, in machine- 
readable form, the syntax of an XML markup language, there is no comparable mechanism 
for specifying the semantics of an XML vocabulary. That is, there is no way to characterize 
the meaning of XML markup so that the facts and relationships represented by the 
occurrence of XML constructs can be explicitly, comprehensively, and mechanically 
identified. This has serious practical and theoretical consequence ... 

Keywords: SGML, XML, knowledge representation, markup, semantics 



13 Research sessions: Web . XML and IR: FleXPath: flexible structure and full-text 
querying for XML 

Sihem Amer-Yahia, Laks V. S. Lakshmanan, Shashank Pandit 
June 2004 Proceedings of the 2004 ACM SIGMOD international conference on 

Management of data SIGMOD '04 
Publisher: ACM Press 

Full text available: 'Q pdf(437.86 KB) Additional Information: full citation , abstract , references , citings 

Querying XML data is a well-explored topic with powerful database-style query languages 
such as XPath and XQuery set to become W3C standards. An equally compelling paradigm 
for querying XML documents is full-text search on textual content. In this paper, we study 
fundamental challenges that arise when we try to integrate these two querying 
paradigms. While keyword search is based on approximate matching, XPath has exact 
match semantics. We address this mismatch by considering queries on structure ... 
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Data-intensive Web sites are large sites based on a back-end database, with a fairly 
complex hypertext structure. The paper develops two main contributions: (a) a specific 
design methodology for data-intensive Web sites, composed of a set of steps and design 
transformations that lead from a conceptual specification of the domain of interest to the 
actual implementation of the site; (b) a tool called Homer, conceived to support the site 
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design and implementation process, by allowing the ... 

Keywords: Databases, Internet, WWW, World Wide Web, development 
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Since the summer of 1973, when I became a Burroughs Research Fellow, my life has been 
very different from what it had been before. The daily routine changed: instead of going to 
the University each day, where I used to spend most of my time in the company of others, 
I now went there only one day a week and was most of the time that is, when not 
travelling!-- alone in my study. In my solitude, mail and the written word in general 
became more and more important. The circumstance that my employe ... 

16 Compiler construction: an advanced course 

F. L. Bauer, F. L De Remer, M. Griffiths, U. Hill, j. J. Horning, C. H. A. Koster, W. M. 
McKeeman, P. C. Poole, W. M. Waite, G. Goos, J. Hartmanis 
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The Advanced Course took place from March 4 to 15, 1974 and was organized by the 
Mathematical Institute of the Technical University of Munich and the Leibniz Computing 
Center of the Bavarian Academy of Sciences, in co-operation with the European 
Communities, sponsored by the Ministry for Research and Technology of the Federal 
Republic of Germany and by the European Research Office, London. 
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We propose the study of visibly pushdown automata (VPA) for processing XML documents 
VPAs are pushdown automata where the input determines the stack operation, and XML 
documents are naturally visibly pushdown with the VPA pushing onto the stack on open- 
tags and popping the stack on close-tags. In this paper we demonstrate the power and 
ease visibly pushdown automata give in the design of streaming algorithms for XML 
documents. 

We study the problems of type-checking strea ... 

Keywords: XML, pushdown automata, query, schema, streaming algorithms, typing 



18 Alloy: a li g htweight object modelling notation U 
Daniel Jackson 

April 2002 ACM Transactions on Software Engineering and Methodology (TOSEM), 

Volume 11 Issue 2 
Publisher: ACM Press 

i- ii, ^ i u. 0i 0-7 i/o\ Additional Information: full citation , abstract , references , citings, index 

Full text available: TO pdf(346.87 KB) — 

^ terms 

http: //portal .acm.org/results.cfm?coll=ACM&d1=ACM&CFlD=196126. . . 5/25/2007 



Results (page 1): fix??? <paragraph> malformed <paragraph> ... Page 6 of 6 

Alloy is a little language for describing structural properties. It offers a declaration syntax 
compatible with graphical object models, and a set-based formula syntax powerful enough 
to express complex constraints and yet amenable to a fully automatic semantic analysis. 
Its meaning is given by translation to an even smaller (formally defined) kernel. This 
paper presents the language in its entirety, and explains its motivation, contributions and 
deficiencies. 

Keywords: Object models, Z specification language, first-order logic 
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This paper reviews the main innovations of XML and considers their impact on the editing 
techniques for structured documents. Namespaces open the way to compound documents; 
well-formedness brings more freedom in the editing task; CSS allows style to be 
associated easily with structured documents. In addition to these innovative features the 
wide deployment of XML introduces structured documents in many new applications 
including applications where text is not the dominant content type. In Ian ... 

Keywords: CSS, XML, authoring tools, compound documents, direct manipulation, 
structured editing, style languages 
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To explain complex phenomena, an explanation system must be able to select information 
from a formal representation of domain knowledge, organize the selected information into 
multisentential discourse plans, and realize the discourse plans in text. Although recent 
years have witnessed significant progress in the development of sophisticated 
computational mechanisms for explanation, empirical results have been limited. This paper 
reports on a seven-year effort to empirically study explanation ge ... 
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glycob.oxfordjournals.org/cgi/content/full/11/11/979 - Similar pages 
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